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We present a new decoding procedure to transmit classical information in a quantum channel 
which, saturating asymptotically the Holevo bound, achieves the optimal rate of the communication 
line. Differently from previous proposals, it is based on performing a sequence of (projective) 
YES/NO measurements which in A'' steps determines which codeword was sent by the sender (A'' 
being the number of the codewords). Our analysis shows that as long as A'^ is below the limit 
imposed by the Holevo bound the error probability can be sent to zero asymptotically in the length 
of the codewords. 



I. INTRODUCTION 

By constraining the amount of classical information 
which can be reliably encoded into a collection of quan- 
tum states [l[ , the Holevo bound sets a limit on the rates 
that can be achieved when transferring classical messages 
in a quantum communication channel. Even though, for 
finite number of channel uses, the bound in general is 
not achievable, it is saturated 0, [l] in the asymptotic 
limit of infinitely many channel uses. Consequently, via 
proper optimization and regularization Q , it provides the 
quantum analog of the Shannon capacity formula [H, i.e. 
the classical capacity of the quantum channel (e.g. see 
Rcfs. 0,0). 

Starting from the seminal works of Rcf. [J Q sev- 
eral alternative versions of the asymptotic attainability 
of the Holevo bound have been presented so far (e.g. 
see Refs. j7Hll| and references therein). The original 
proof 0, Q was obtained extending to the quantum 
regime the typical subspace encoding argument of Shan- 
non communication theory In this context an ex- 
plicit detection scheme (sometime presented as the pretty 
good measurement (PGM) scheme was introduced 

that allows for exact message recovery in the asymptotic 
limit infinitely long codewords. More recently, Ogawa 
and Nagaoka and Hayashi and Nagaoka [lO| proved 
the asymptotic achievability of the bound by establish- 
ing a formal connection with quantum hypothesis test- 
ing problem [l^, and by generalizing a technique (the 
information-spectrum method) which was introduced by 
Verdii and Han 14 1 in the context of classical communi- 



cation channel. 

In this paper we analyze a new decoding procedure for 
classical communication in a quantum channel. Here we 
give a formal proof using conventional methods, whereas 
in [15| we give a more intuitive take on the argument. 



Our decoding procedure allows for a new proof of the 
asymptotic attainability of the Holevo bound. As in 
Refs. [l, 3 it is based on the notion of typical subspace 
but it replaces the PGM scheme with a sequential decod- 
ing strategy in which, similarly to the quantum hypoth- 
esis testing approach of Ref. the received quantum 



codeword undergoes to a sequence of simple YES/NO 
projective measurements which try to determine which 
among all possible inputs my have originated it. To prove 
that this strategy attains the bound we compute its as- 
sociated average error probability and show that it con- 
verges to zero in the asymptotic limit of long codewords 
(the average being performed over the codewords of a 
given code and over all the possible codes). The main 
advantage of our scheme resides on the fact that, differ- 
ently from PGM and its variants [HI, [l6l - l26j . it allows 
for a simple intuitive description, it clarifies the role of 
entanglement in the decoding procedure [l5|, its analy- 
sis avoids some technicalities, and it appears to be more 
suited for practical implementations. 

The paper is organized as follows: in Scc.|ll]we set the 
problem and present the scheme in an informal, non tech- 
nical way. The formal derivation of the procedure begins 
in the next section. Specifically, the notation and some 
basic definitions are presented in Sec. IIIII Next the new 
sequential detection strategy is formalized in Sec. lIVi and 
finally the main result is derived in Sec. |Vl Conclusions 
and perspectives are given in Scc. lVIl The paper includes 
also some technical Appendixes. 



II. INTUITIVE DESCRIPTION OF THE MODEL 



The transmission of classical messages through a quan- 
tum channel can be decomposed in three logically distinct 
stages: the encoding stage in which the sender of the mes- 
sage (say, Alice) maps the classical information she wish 
to communicate into the states of some quantum objects 
(the quantum information carriers of the system); the 
transmission stage in which the carriers propagate along 
the communication line reaching the receiver (say. Bob) ; 
and the decoding stage in which Bob performs some quan- 
tum measurement on the carriers in order to retrieve Al- 
ice's messages. For explanatory purposes we will restrict 
the analysis to the simplest scenario where Alice is bound 
to use only unentangled signals and where the noise in 
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the channel is memory less Under this hypothesis the 
coding stage can be described as a process in which Alice 
encodes N classical messages into factorized states of n 
quantum carriers, producing a collection C of N quan- 



tum codewords of the form 177 := a^ 



a-i where 



Ji 7 ■ • ■ J jn are symbols extracted from a classical alpha- 
bet and where we use N different vectors j. Due to the 
communication noise, these strings will be received as 
the factorized states :~ pj^ ■ • • (8) Pj„ (the output 
codewords of the system) , where for each j we have 



(1) 



T being the completely positive, trace preserving chan- 
nel [28| that defines the noise acting on each carrier. Fi- 
nally, the decoding stage of the process can be charac- 
terized by assigning a specific Positive Valued Operator 
Measurement (POVM) ^ which Bob applies to pj to 

get a (hopefully faithful) estimation j' of the value j. 
Indicating with {Xj, Xq = 1 — J2jeC -^j} elements 
which compose the selected POVM, the average error 
probability that Bob will mistake a given j sent by Alice 
for a different message, can now be expressed as, e.g. see 
Ref. 0, 



(2) 



In the limit infinitely long sequences n — !> 00, it is 
known 0, 0, Pzl-fioj that P^rr can be sent to zero under the 
condition that N scales as 2"^ with R being bounded by 
the optimized version of the Holevo information, i.e. 



R max x{{P],P]}) , 



(3) 



where the maximization is performed over all possible 
choices of the inputs aj and over all possible probabili- 
ties Pj , and where for a given quantum output ensemble 
{pj , Pj } we have 



xi{P3,Pj}) ■■= Si^PjPj) - ^PjSipj) 



(4) 



with S(-) := — Tr[(-) log2(-)] being the von Neumann en- 
tropy [28| . The inequality in Eq. ([3]) is a direct con- 
sequence of the Holevo bound and its right- hand- 
side defines the so called Holevo capacity of the channel 



^ A similar formulation of the problem holds also when entan- 
gled signals are allowed: in this case however the aj defined in 
the text represents (possibly entangled) states of m-longs blocks 
of carriers: for each possible choice of m, and for each possi- 
ble coding/decoding strategy one define the error probability as 
in Eq. l[2]l. The optimal transmission rate (i.e. the capacity of 
the channel) is also expressible as in the rhs term of Eq. ^ 
via proper regularization over m (this is a consequence of the 
super-additivity of the Holevo information Q)). Finally the same 
construction can be applied also in the case of quantum commu- 
nication channels with memory, e.g. see Ref. 



T, i.e. the highest achievable rate of the communica- 
tion line which guarantees asymptotically null zero er- 
ror probability under the constraint of employing only 
unentangled codewords^. In Refs. [H, Q the achievabil- 
ity of the bound ([3]) was obtained by showing that that 
from any output quantum ensemble {pj , pj } it is possible 
to identify a set of ~ 2"^^^p^'P^^'> output codewords pj, 
and a decoding POVM for which the error probability of 
Eq. ^ goes to zero as n increases. Note that proceed- 
ing this way, one can forget about the initial mapping 
j — > (Tj and work directly with the j — >■ pj mapping. 
This is an important simplification which typically is not 
sufficiently stressed (see however Ref. [l3|)- Within this 
framework, the proof 0, Q exploited the random coding 
trick by Shannon in which the POVM is shown to provide 
exponential small error probability in average, when me- 
diating over all possible groups of codewords associated 
with {pj,Pj}. 



The idea we present here follows the same typicality 
approach of Refs. 0, Q but assumes a different detection 
scheme. In particular, while in Refs. 0, Q the POVM 
produces all possible outcomes in a single step as shown 
schematically in the inset of Fig.[l] our scheme is sequen- 
tial. Namely, Bob performs a sequence of measurements 
to test for each of the codewords. Specifically, he per- 
forms a first YES/NO measure to verify whether or not 
the received signal corresponds to the first element of the 
list, see Fig.[TJ If the answer is YES he stops and declares 
that the received message was the first one. If the answer 
is NO he takes the state which emerges from the measure- 
ment apparatus and performs a new YES/NO measure 
aimed to verify whether or not it corresponds to the sec- 
ond elements of the list, and so on until he has checked 
for all possible outcomes. The difficulty resides in the 
fact that, due to the quantum nature of the codewords, 
at each step of the protocol the received message is par- 
tially modified by the measurement (a problem which will 
not occur in a purely classical communication scenario) . 
This implies for instance that the state that is subject 
to the second measurement is not equal to what Bob re- 
ceived from the quantum channel. As a consequence, to 
avoid that the accumulated errors diverge as the detec- 
tion proceeds, the YES/NO measurements needs to be 
carefully designed to have little impact on the received 
codewords. As will be clarified in the following section we 
tackle this problem by resorting on the notion of typical 
subspaces [23: specifically our YES/NO measurements 
will be mild modifications of von Neumann projections 
on the typical subspaces of the codewords, in which their 
non exact orthogonality is smoothed away by rescaling 
them through further projection on the typical subspace 
of the source average message (see Sec. IIVI for details). 



^ See footnote [T] 
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RECEIVED 




Figure 1: Flowchart representation of the detection scheme; 
the projections on the typical subspace Ulypij) of the code- 
words are represented by the open circles, while the projec- 
tions on the typical subspace Hfyp of the average message of 
the source are represented by the black circles. The inset de- 
scribes the standard PGM decoding scheme which produces 
all the possible outcomes in a single step. 

III. SOURCES, CODES AND TYPICAL 
SUBSPACES 

In this section we review some basic notions and intro- 
duce the definitions necessary to formalize our detection 
scheme. 

An independent, identically distributed quantum 
source is defined by assigning the quantum ensemble 
£ = {pj, pj : j £ A} which specifies the density matrices 
Pj 6 ©(H) emitted by the source as they emerge from the 
memoryless channel, as well as the probabilities pj asso- 
ciated with those events (here j is the associated classical 
random variable which takes values on the domain A). 
Since the channel is memoryless, when operated n con- 
secutive times, it generates products states pj £ ©(?^®") 
of the form 

PJ := ® • • • ® Pj„ , (5) 

with probability 

Pj-=PjiPj2---Pj^, (6) 

(in these expressions j := (ji,--- ,j„) G A^). In strict 
analogy to Shannon information theory, one defines a 
A^-clement CODE C as a collection of A'' states of the 
form ([S]); i-e. 

C:={pjG6(?^«"):jeC}, (7) 

with C being the subset of A^ which identifies the ele- 
ments of C (i.e. the codewords of the code). The proba- 
bility that the source will generate the code C can then 
be computed as the (joint) probability of emitting all the 
codewords that compose it, i.e. 

n 

PiC) =1[PJ=I[I[P^^ (8) 



A. Typical spaces 

Consider p = ^jPjPj G ©(H) the average density 
matrix associated with the ensemble £, and let p = 
(7£|e£)(e^| its spectral decomposition (i.e. \eg) arc the 
orthonormal basis of H formed by the eigenvectors of p 
while qg are their eigenvalues). For fixed S > 0, one de- 
fines [1^ the typical subspace H^yp of p as the subspace 
of "H®" spanned by those vectors 

\ei) ■■= \ee,) ® ■ ■ ■ \eej , (9) 

whose associated probabilities q^- := qi^qi2 • • 'qi^ satisfy 
the constraint, 

2-n(S{p)+S) ^ 2-"(^('')"<5) ^ (10) 

where S{p) = — Tr[/9log2p] is the von Neumann entropy 
of p (as in the classical case [s^, the states je^) defined 
above can be thought as those which, in average, contain 
the symbol je^) almost nqi times). Identifying with C 
the set of those vectors ^ = (£i, ^2, • • • , ^n) which satisfies 
Eq. pll)l . the projector P on T-Lujl can then be expressed 
as 

while the average state p®" is clearly given by 

p^-^Y.q^\e^){e^. (12) 

I 

By construction, the two operators satisfy the inequali- 
ties 

p 2-n(S(p)+s) ^ Pp'^'^p ^ p 2-"('S'(p)-'5) _ (13) 

Furthermore, it is known that the probability that E will 
emit a message which is not in H^^p is exponentially de- 
pressed [2§|. More precisely, for all e > it is possible 
to identify a sufficiently large no such for all n ^ no we 
have 

Tr[p®"(l -P)] < e . (14) 

Typical subsets can be defined also for each of the 
product states of Eq. (O, associated to each codeword 
at the output of the channel. In this case the definition 
is as follows 01 : first for each j G ^ wc define the spectral 
decomposition of the element pj, i.e. 

p.-E^il4>(4l> (15) 

k 

where |e^) are the eigenvectors of pj and the corre- 
sponding eigenvalues (notice that while (e^|e^/) = <5fcfc' 
for all k,k' and j, in general the quantities (e;[|e],,) are 
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a-priori undefined). Now the spectral decomposition of 



the codeword is provided by, 



(16) 



where for k := (fci, • • • , fc„) one has 



14:) . 



(17) 



Notice that for fixed j the vectors |eg ■*) are an orthonor- 

mal set of "H®"; notice also that in general such vectors 
have nothing to do with the vectors |e^) of Eq. 

Now the typical subspace T-L^jpii) of Pj defined as 
the linear subspace of "H®" spanned by the le^'') whose 
associated satisfy the inequality, 



2-n(S(p)-x(£)+5) ^ ^ 2-"('S'(p)-x(£)-5) ^ (^g) 



with 



X{£) := S{p) - Y^P]S{pj) , 



(19) 



being the Holevo information of the source £. The pro- 



jector on HtypU) '^^^ then be written as 



(20) 



where IC:' identify the set of the labels k which satisfy 



(i) 



Eq. (HHD. 

We notice that the bounds for the probabilities A- 

k 

do not depend on the value of j which defines the se- 
lected codeword: they are only function of the source 
£ only (this of course does not imply that the subspace 

'Htyp{j) will not depend on j). It is also worth stressing 
that since the vectors je^;'''} in general are not orthogo- 



B. Decoding and Shannon's averaging trick 

The goal in the design of a decoding stage is to identify 
a POVM attached to the code C that yields a vanishing 
error probability as n increases in identifying the code- 
words. How can one prove that such a POVM exists? 
First of all let us remind that a POVM is a collection of 
positive operators {Xj, Xq = 1 — Sjec -^J '■ ^ S C}. The 
probability of getting a certain outcome j' when measur- 
ing the codeword pj is computed as the expectation value 
Tr[Xj,pj] (the outcome associated with Tr[Xo/Oj] corre- 
sponds to the case in which the POVM is not able to 
identify any of the possible codewords). Then, the error 
probability (averaged over all possible codewords of C) 
is given by the quantity 



(22) 



Proving that this quantity is asymptotically null will be 
in general quite complicated. However, the situation sim- 
plifies if one averages Perr(C) with all codewords C that 
the source £ can generate, i.e. 



E^(c) 

c 



Perr (C) 



(23) 



P(C) being the probability defined in Eq. ([8]). Proving 
that {Perr) nullifies for n oo implies that at least one 
of the codes C generated by C allows for asymptotic null 
error probability with the selected POVM (indeed the 
result is even stronger as almost all those which are ran- 
domly generated by C will do the job). In Refs. [2|,Q the 
achievability of the Holevo bound was proven adopting 
the pretty good measurement detection scheme, i.e. the 
POVM of elements 



PP^P 



[E 

hec 



PPnP 



^ , (24) 

hec hec 
Xo = (25) 
Jec 

where P is the projector ([TT|) on the typical subspace of 
the average state of the source, for j G C the Pj are the 
nal with respect to the label j, there will be a certain projectors ^ associated with the codeword p^. With 



overlap between the subspaces 'H[yp{j). The reason why 
they are defined as detailed above stems from the fact 
that the probability that pj will not be found in 'Hlyp{j) 
(averaged over all possible realization of pj) , can be made 
arbitrarily small by increasing n, e.g. see Ref. [3]. More 
precisely, for fixed S > 0, one can show that for all e > 
there exists no such that for all n > uq integer one has. 



this choice one can verify that, for given e there exist n 
sufficiently large such that Eq. (|23| yields the inequal- 
ity 



(Perr) 4e + (A^ - 1) 2-"(x(^)-25) 



(26) 



PJ Tr[pj(l - Pj)] < e 



(21) 



This implies that as long as — 1 is smaller than 
2-"(x(f')-25) gj_jg gg^j^ bound the (average) error proba- 
bility close to zero. 



where pj is the probability 
emitted the codeword pj. 



that the source £ has 



IV. THE SEQUENTIAL DETECTION SCHEME 

In this section we formalize our detection scheme. 
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As anticipated in the introduction, the idea is to deter- 
mine the value of the label j associated with the received 
codeword pj, by checking whether or not such state per- 
tains to the typical subspace of the element j of the se- 
lected code C. 

Specifically we proceed as follows 

• first we fix an ordering of the codewords of C yield- 
ing the sequence ji , j2 , , ■ ' ■ : Jn with j„ G C for 
all u = 1, • • • , -/V (this is not really relevant but it 
is useful to formalize the protocol); 

• then Bob performs a YES/NO measurement that 
determines whether or not the received state is the 
typical subspace of the first codeword ji^; 

• if the answer is YES the protocol stops and Bob 
declares to have identified the received message as 
the first of the list (i.e. ji); 

• if the answer is NO Bob, performs a YES/NO mea- 
surement to check whether or not the state is in the 
typical sub of j2 ; 

• the protocol goes on, testing similarly for all N pos- 
sibilities. In the end we will either determine an 
estimate of the transmitted j or we will get a null 
result (the messages has not been identified, corre- 
sponding to an error in the communication). 

We now better specify the YES/NO measurements. In- 
deed, as mentioned earlier, we have to "smooth" them to 
account for the disturbance they might introduce in the 
process. For this purpose, each of such measurements 
will consist in two steps in which first we check (via a 
von Neumann projective measurement) whether or not 

the incoming state is in the typical subspace T-l^yp of the 
average message. Then we apply a von Neumann pro- 
jective measurement on the typical subspace 'Htypiii) of 
the i-th codeword of Bob's list (see Fig. [T|). Hence, the 
POVM elements are defined as follows. The first element 
El tests if the transmitted state is in 'H|yp(ji), so it is 
described by the (positive) operator 



El 



where for any operator Q the symbol stands for 
6 := pep , 



(27) 



(28) 



P being the projector of Eq. pT|) . Similarly the remain- 
ing elements can be expressed as follows 



E2 

Ei 



= (1 - P-. )p^ (a - P- ) 



(29) 



= (1 - P-. )(]i -p^) p-^ (a - P-. )(i - P- ) , 

^ 31 32' 33 ^ 32'^ Jl' ' 



(see Appendix 1X1 for an explicit derivation). A compact 
expression can be derived by writing 



Eu = MlMu , 



(30) 



where 



Mu := P- P 



"3^-1 ^3^-2 ^31 ' 

with Qj being the orthogonal complement of P^, i.e 
Q-:=^-Pl- 



(31) 



(32) 



With such definitions the associated average error prob- 
ability (|23)) can then be expressed as. 



N 



{-Perr ) 



Jir-- ,3n 



u=l 
N-1 



^-^EpiI: 1>^(PJ)] , (33) 



e=o 



where we used the fact that the summations over the var- 
ious ji are independent. In writing the above expression 
we introduced the following super-operator 



(34) 



which is completely positive and trace decreasing, and 
we use the notation $^ to indicate the £-fold concatena- 
tion of super-operators, e.g. $^(-) = $($(•)). It is worth 
noticing that the possibility of expressing (Perr) in term 
of a single super-operator follows directly from the av- 
erage we have performed over all possible codes C. For 
future reference we find it useful to cast Eq. p3p in a 
slightly different form by exploiting the the definitions of 
Eqs. ([TS]) and (pn)) . More precisely, we write 

N-l 



N-1 



E E E E "^^^^ 



^=0 3jl,-Jt k fe'6/Cj 



N 



X 



(eV^IQ- •••Q- leV 

^ fe' '^Ji ^Je' k 



{3)\ 



(35) 



V. BOUNDS ON THE ERROR PROBABILITY 



^ It is worth stressing that in Ref. Il5f this test was implemented 
by performing a series of rank-one projective measurements on 
to a basis of the subspace. 



In this section we derive an upper limit for the er- 
ror probability p3|) which will lead us to the prove the 
achicvability of the Holcvo bound. 
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Specifically, we notice that 

EE 



k I ^ fc' '^Ji ' fc ' I 



k k'eK^ 



^ E AS?' \^^~^. 

k<^K^ 



k£K^ 



^lE Af (eflQ----Q^lef)l^ 



keK:; 



(36) 



where the first inequality follows by dropping some pos- 
itive terms (those with k ^ k'), the first identity simply 
exploits the fact that the X^^ are normalized probabili- 
ties when summing over all fc, and the second inequality 
follows by applying the Cauchy-Schwarz inequality. Re- 
placing this into Eq. ([35]) we can write 



1 {-Perr) ^ 



(37) 



N 



This can be further simplified by invoking again the 
Cauchy-Schwarz inequality this time with respect to the 
summation over the j, ji, ■ • • i-e. 

_ E^ PjPn---Pn I^IPJPJPJ Qn---Qn 
jji,--- Jt 

>\ E PjPj.---Pn^'iPjPlPl Qn---QlM 



3,31, ■■■ ,31 



= (Tr[Wi Q']y , 
where for q integer we defined 

3 

2 E PI Qj. = S - ^0 



(38) 

(39) 
(40) 



(notice that Wq is not p*^", e.g. see Eq. (|B1[) ). Therefore 
one gets 



N-l 



(41) 



i=0 



To proceed it is important to notice that the quantity Q 
is always positive and smaller than 1, i.e. 



1 > Q ^ . 



(42) 



Both properties simply follow from the identity 

Q = P(l - Y^PlPj^P = ^[E (1 - ^;)]^ ' (43) 

I 3 

and from the fact that 1 > 1 — P7 ^ 0. We also notice 



that 



(44) 



where the last inequality is obtained by observing that 

(1) 

the typical eigenvalues are lower bounded as in 

Eq. (|18p . From the above expressions we can conclude 
that the quantity in the summation that appears on the 
Ihs of Eq. (I^Tj) is always smaller than one and that it 
is decreasing with £. An explicit proof of this fact is as 
follows 

s$ Tr[iyi Q^] = Tr[v/Wi Q y^] 

Tr[v/m 1 VW^] = Tr[VFi Q^"^] , 

where we used the fact that the square root of a non 
negative operator can be taken to be non negative too (for 
a more detailed characterization of Wq see Appendix |B| . 
A further simplification of the bound can be obtained by 
replacing the terms in the summation of Eq. (j4ip with 
the smallest addendum. This yields 



— 2 — 

where, using the fact that 1 = 1 = P, we defined 

Af-l 



(45) 



A := Tr[M^iQ^-i] = ^ ( ) (-1)^ (46) 
:= Tr[M/i P . (47) 



It turns out that the quantities fz defined above are pos- 
itive, smaller than one, and decreasing in z. Indeed as 
shown in the Appendix [C] they satisfy the inequalities 

O^fzi^fo 2-"^('^(^'-2<s) fQj. ^jj integer z, (48) 

and. for each given e, there exists a sufficiently large uq 
such that for n ^ no 



(49) 



Using these expressions, we can derive the following 
bound on A, i.e. 



N~l 



N-l 



^ = /o + E 

z=l 

^ /o - E ( ' )^ = 2/0 - E ( ' 

Z=l 2=0 

^ Vo-foJ2 (^-^)2-"^W^)-2^) 



/o [2_(l + 2-"W^)-2^))^-i] 



(50) 
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where in the first inequality we get a bound by taking all 
the terms of fc 1 with the negative sign, the second from 
(|48ll . Now, on one hand if N is too large the quantity 
on the rhs side will become negative as we are taking the 
A'^ power of a quantity which is larger than 1. On the 
other hand, if TV is small then for large n the quantity in 
the square parenthesis will approach 1. This implies that 
there must be an optimal choice for N in order to have 
[2 — (1 + 2~"('<:(^'~2'^))^~i] approaching one for large n. 
To study such threshold we rewrite Eq. ([50)1 as 

A ^ /o[2 - Y{x = 2>^(^)-2^ y = N, n)] , (51) 

where we defined 

+ . (52) 

We notice that for x,y ^ 1, in the limit of n — > oo the 
quantity \og[Y(x, y, n)] is an indeterminate form. Its be- 
havior can be studied for instance using the de I'Hopital 
formula, yielding 

lim log[F(.T,y,n)] = lim f^V\ (53) 

n-i-oo logy n— s-oo \x/ 

This shows that if y < a; the limit exists and it is zero, i.e. 
liuin^oo Y{x,y,n) = 1. Vice-versa for y > x the limit 
diverges, and thus limn^oo Y{,x,y,n) = co. Therefore, 
assuming N ~ 2"^ , we can conclude that as long as 

R < xi£) - 2S , (54) 

the quantity on the rhs of Eq. ([?T|) approaches /o as n 
increases (this corresponds to having y < x in the Y 
function). Reminding then Eq. we get 

1 - {P,rr) ^ > /2 > |1 - e|2 > 1 - 2e , (55) 
and thus 

(Perr) < 2e . (56) 

On the contrary, if _R > x(^) ~ 2(5, the lower bound on 
A becomes infinitely negative and hence useless to set a 
proper upper bound on (Perr)- 

To summarize, we have shown that adopting the se- 
quential detection strategy defined in Sec. |IV]we can con- 
clude that it is possible to send N = 2"^ messages with 
asymptotically vanishing error probability, for all rates R 
which satisfy the condition ■ 

VI. CONCLUSIONS 

To summarize: the above analysis provides an ex- 
plicit upper bound for the averaged error probability of 
the new detection scheme (the average being performed 
over all codewords of a given code, and over all possi- 
ble codes). Specifically, it shows that the error proba- 
bility can be bound close to zero for codes generated by 



sources £ which have strictly less than 2"^^^^^ elements. 
In other words, our new detection scheme provides an al- 
ternative demonstration of the achievability of the Holevo 
bound 0. 

An interesting open question is to extend the technique 
presented here to a decoding procedure that can achieve 
the quantum capacity of a channel [sTI - fs^ . 
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Appendix A: Derivation of the POVM 

Here we provide an explicit derivation of the 
POVM associated with our iterative measurement 
procedure. It is useful to describe the whole process as 
a global unitary transformation that coherently trans- 
fers the information from the codewords to some external 
memory register. 

Consider, for instance, the first step of the detection 
scheme where Bob tries to determine whether or not a 
given state j^E') e f^®" corresponds to the first codeword 
pj^ of his list. The corresponding measurement can be 
described as the following (two-step) unitary transforma- 
tion 

|*)|00)b, ^ P|*)|01)bi +(1-P)|*)|00)bi 

^ pj^p\mi)B, + {1 - Pj^)pmoi)B, 

+ (1-P)|vI/)|00)b,,(A1) 

where Bi represents a two-qubit memory register which 
stores the information extracted from the system. Specif- 
ically, the first qubit records with a "1" if the state IvP) 

belongs to the typical subspace 'H^^p of the average state 
of the source (instead it will keep the value "0" if this is 
not the case). Similarly, the second qubit of Bi records 
with a "1" if the projected component Pj^*) is in the 
typical subspace V-lyliji) of pj^. Accordingly the joint 
probability of success of finding j?') in Tilyp and then in 
'^iylGi) is given by 

Vii'f) = {^\PPj^P\^) , (A2) 

in agreement with the definition of Ei given in Eq. (|27p . 
Vice- versa the joint probability of finding the state j^*) 
in in Hj"^ and then not in "Ulyliji) is given by (4'|P(1 — 
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Pj^)P|^'} and finally the joint probability of not finding 

5') in in Tilyp is — ^1^)- Let us now consider the 
second step of the protocol where Bob checks wether or 

not the message is in the typical subspace "Hj^pQ^) of 
pj^ . It can be described as a unitary gate along the same 
lines of Eq. (|A1|) with Pj^ replaced by Pj^ , and Bi with a 
new two-qubit register B2 • Notice however that this gate 
only acts on that part of the global system which emerges 
from the first measurement with Bi in |01}. This implies 
the following global unitary transformation, 

|*)|00)bJOO)b, ^ P- P|*)|11)bJ00)s, 

+[pj^p{i-pjjpmm)B,mB, 

+ (1 - PjJP{l - P-JP|v&)|01)bJ01)b, 
+ (1-P)(1-/^JP|vI/)|01)bJ00)b.] 
+ (1-P)|*)|00)bJ00)b,, (A3) 

which shows that the joint probability of finding \^!) in 
'^iylih) (after having found it in H^yp, not in V^ll^ji), 
and again in ulyp) is 

V2{^) = |P(1 - PjJPPj^_P{l - Pj^)Pm , (A4) 



in agreement with the definition of E2 given in Eq. ([29| . 
Reiterating this procedure for all the remaining steps one 
can then verify the validity of Eq. (PH)) for all u ^ 2. 
Moreover, it is clear (e.g. from Eq. (|A2[) and (jA4p ) that 
it is a quite different POVM from the conventionally used 
pretty good measurement 0, . 



Appendix B: Some useful identities 

In this section we derive a couple of inequalities which 
are not used in the main derivation but which allows us 
to better characterize the various operators which enter 
into our analysis. First of all we observe that 



where we used Eq. (|T8)) . We can also prove the following 
identity 

^ p(J _ p<Sm 2n('S'(p)-x(£)+'5))p 

^ P (1 _2-"('^(^)^2i)) ^ (B2) 



which follows by using Eq. (|T3|). Notice that due to 
Eq. (|Bip this also gives 



PWqP ^ P 2""(''(^)-2'5) . 



(B3) 



Wo=Y.PjPj^P^" 2 



n{S{p)-xi£)+S) 



(BI) 



Appendix C: Characterization of the function/2 

We start deriving the inequalities of Eq. P5)) first. To 
do we observe that for all e' positive we can write 

PJ Tr[pj(l - Pj)P] ^ J2 PJ Tr[Pj(l - Pj)] < ^' , 



where the first inequality follows by simply noticing that 
/9j(l— Pj) is positive semidefinite (the two operators com- 
mute), while the last is just Eq. ([21]) which holds for 
sufficiently large n. Reorganizing the terms and using 
Eq. this finally yields 

/o = Tr[I^i P] >Y. PJ Trbj^] - e' 
J 

= Tr[p®"P] - e' > 1 - 2e' , (CI) 

which corresponds to the lefttmost inequality of Eq. 
by setting e — 2e'. The rightmost inequality instead fol- 
lows simply by observing that 

/o = Tr[VFi P] < Tr[VKi] = 5I^'jTr[Pjpj] ^ 1 . (C2) 



which follows by the following chain of inequalities. 



E |ef )(ef |Af 2"(^('')-x(^)+^) 

/-^ ' k ' ^ 'k ' k 
1 k 

I 

p®n 2n(S(p)-x(£)+S) ^ 



To prove the inequality (pS)) we finally notice that for 
z ^ 1 we can write 

= Tr[M^i PW^] = Ty[Wi W§] 

= TyI^/WiW^WqW^ ^/w'l] 

^ Tr[yWi W^P yWi]2-"('^(^)"2*) 

s$ Tr[yW^ W^W^ yw^]2-"(x(^)-25) 

where we used the fact that the operators operators Wi, 
Wo are non negative. The expression p8|) then follows 
by simply reiterating the above inequality z times. 
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